Challenges for Value-driven Semantic Data Quality Management

نویسنده

  • Rob Brennan
چکیده

This paper reflects on six years developing semantic data quality tools and curation systems for both largescale social sciences data collection and a major web of data hub. This experience has led the author to believe in using organisational value as a mechanism for automation of data quality management to deal with Big Data volumes and variety. However there are many challenges in developing these automated systems and this discussion paper sets out a set of challenges with respect to the current state of the art and identifies a number of potential avenues for researchers to tackle these challenges.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

RDF Implementation of Clinical Trial Data Standards

Clinical trials pose increasing challenges to sponsors and regulators in terms of execution complexity, timing constraints, risk management, and quality of scientific output. We show how the systematic use of clinical trial data standards, combined with the application of model driven semantic technology within the context of a metadata registry can provide a radical but more effective way to s...

متن کامل

The Elephants in the Room: Sex, HIV, and LGBT Populations in MENA. Intersectionality in Lebanon; Comment on “Improving the Quality and Quantity of HIV Data in the Middle East and North Africa: Key Challenges and Ways Forward”

In response to this insightful editorial, we wish to provide commentary that seeks to highlight recent successes and illuminate the often unspoken hurdles at the intersections of culture, politics, and taboo. We focus on sexual transmission and draw examples from Lebanon, where the pursuit of data in quality and quantity is teaching us lessons about the way forward and where we are experiencing...

متن کامل

A Semantic Infrastructure for a Knowledge Driven Sensor Web

Sensor Web researchers are currently investigating middleware to aid in the dynamic discovery, integration and analysis of vast quantities of high quality, but distributed and heterogeneous earth observation data. Key challenges being investigated include dynamic data integration and analysis, service discovery and semantic interoperability. However, few efforts deal with the management of both...

متن کامل

Ontology-based data quality framework for data stream applications

Data Stream Management Systems (DSMS) have been proposed to address the challenges of applications which produce continuous, rapid streams of data that have to be processed in real-time. Data quality (DQ) plays an important role in DSMS as there is usually a trade-off between accuracy and consistency on the one hand, and timeliness and completeness on the other hand. Previous work on data quali...

متن کامل

Process-driven data and information quality management in the financial service sector

Highly regulated sectors face challenges on data and information quality management (DIQM) to conform to increasing regulations. With the financial service sector, as the most highly regulated industry, we are interested in current and future DIQM challenges. For a sustaining improvement, data quality should be managed process-driven. Process-driven data quality management (PDDQM) provides cont...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017